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Abstract It is shown that Schrodinger's equation may be derived from three pos- 
tulates. The first is a kind of statistical metamorphosis of classical mechanics, a set 
of two relations which are obtained from the canonical equations of particle me- 
chanics by replacing all observables by statistical averages. The second is a local 
conservation law of probability with a probability current which takes the form of 
a gradient. The third is a principle of maximal disorder as realized by the require- 
ment of minimal Fisher information. The rule for calculating expectation values 
is obtained from a fourth postulate, the requirement of energy conservation in the 
mean. The fact that all these basic relations of quantum theory may be derived from 
premises which are statistical in character is interpreted as a strong argument in 
favor of the statistical interpretation of quantum mechanics. The structures of quan- 
tum theory and classical statistical theories are compared and some fundamental 
differences are identified. 
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1 Introduction 

The interpretation of quantum theory does neither influence its theoretical predic- 
tions nor the experimentally observed data. Nevertheless it is extremely important 
because it determines the direction of future research. One of the many contro- 
versial interpretations of quantum mechanics is the "statistical interpretation" or 
"ensemble interpretation" [1]. It presents a point of view, which is in opposition 
to most variants of the Copenhagen interpretation [5] , but has been advocated by 
a large number of eminent physicists, including Einstein. It claims that quantum 
mechanics is incomplete with regard to the description of single events and that 
all its dynamic predictions are of a purely statistical nature. This means that, in 
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general, a large number of measurements on identically prepared systems have to 
be performed in order to verify a (dynamical) prediction of quantum theory. 

The origin of the time-dependent Schrodinger equation is of course an essen- 
tial aspect for the interpretation of quantum mechanics. Recently a number of 
derivations of Schrodinger 's equation have been reported which use as a starting 
point not a particle Hamiltonian but a statistical ensemble. The basic assumptions 
underlying these works include special postulates about the structure of momen- 
tum fluctuations |14j . the principle of minimum Fisher information 31,41 , a linear 
time-evolution law for a complex state variable [24] , or the assumption of a classical 
stochastic force of unspecified form [20] . The work reported in this paper belongs 
to this class of theories, which do not "quantize" a single particle but a statistical 
ensemble. It is shown that Schrodinger's equation may be derived from a small num- 
ber of very general and simple assumptions - which arc all essentially of a statistical 
nature. In a first step an infinite class of statistical theories is derived, containing 
a classical statistical theory as well as quantum mechanics. In a second step quan- 
tum mechanics is singled out as "most reasonable statistical theory" by imposing 
an additional requirement. This additional requirement is the principle of maximal 
disorder as realized by the principle of minimal Fisher information. 

We begin in section [2] with a general discussion of the role of probability in 
physical theories. In section [3] the central 'statistical condition' (first assumption) 
of this work is formulated. The set of corresponding statistical theories is derived 
in section [5] In sections [4] and [7] structural differences between quantum theory 
and classical statistical theories are investigated. The quantum mechanical rule for 
calculating expectation values is derived from the requirement of conservation of 
energy in the mean in section [S] In sections HO the principle of maximal disorder is 
implemented and Fisher's information measure is derived in section [TOl Section [TT] 
contains a detailed discussion of all assumptions and results and may be consulted 
in a first reading to obtain an overview of this work; questions of interpretation 
of the quantum theoretical formalism are also discussed in this section. In the last 
section [T2l open questions for future research are listed. 



2 On probability 

With regard to the role of probability, three types of physical theories may be 
distinguished. 

1. Theories of type 1 are deterministic. Single events are completely described 
by their known initial values and deterministic laws (differential equations). 
Classical mechanics is obviously such a theory. We include this type of theory, 
where probability does not play any role, in our classification scheme because it 
provides a basis for the following two types of theories. 

2. Theories of type 2 have deterministic laws but the initial values are unknown. 
Therefore, no predictions on individual events are possible, despite the fact that 
deterministic laws describing individual events are valid. In order to verify a 
prediction of a type 2 theory a large number of identically prepared experi- 
ments must be performed. We have no problems to understand or to interpret 
such a theory because we know its just our lack of knowledge which causes the 
uncertainty. An example is given by classical statistical mechanics. Of course, 
in order to construct a type 2 theory one needs a type 1 theory providing the 
deterministic laws. 

3. It is possible to go one step further in this direction increasing the relative 
importance of probability even more. We may not only work with unknown 
initial values but with unknown laws as well. In these type 3 theories there 
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are no deterministic laws describing individual events, only probabilities can be 
assigned (so there is no need to mention initial values any more). 

Type 2 theories could also be referred to as classical (statistical) theories. Type 3 
theories are most interesting because we recognize here characteristic features of 
quantum mechanics. In what follows we shall try to make this last statement more 
definite. 

Comparing type 2 and type 3 theories, one finds two remarkable aspects. The 
first is a subtle kind of "inconsistency" of type 2 theories: If we are unable to know 
the initial values of our observables (at a particular time), why should we be able 
to know these values during the following time interval (given we know them at 
a fixed time). In other words, in type 2 theories the two factors determining the 
final outcome of a theoretical prediction - namely initial values and laws - are not 
placed on the same (realistic) footing. This hybrid situation has been recognized 
before; the term 'crypto-deterministic' has been used [29j to characterize classical 
statistical mechanics. Type 3 theories do not show this kind of inconsistency. 

The second observation is simply that type 2 and type 3 theories have a number 
of important properties in common. Both are unable to predict the outcome of 
single events with certainty; only probabilities are provided in both cases. In both 
theories the quantities which may be actually observed - whose time dependence 
may be formulated in terms of a differential equation - are averaged observables, 
obtained with the help of a large number of single experiments. These common 
features lead us to suspect that a general structure might exist which comprises 
both types of theories. 

Such a general structure should consist of a set of (statistical) conditions, which 
have to be obeyed by any statistical theory. In theories of this kind observables in 
the conventional sense do not exist. Their role is taken over by random variables. 
Likewise, conventional physical laws - differential equations for time-dependent ob- 
servables - do not exist. They are replaced by differential equations for statistical 
averages. These averages of the (former) observables become the new observables, 
with the time t playing again the role of the independent variable. In order to 
construct such general conditions one needs again (as with type 2 theories) a de- 
terministic (type I) theory as a "parent" theory. Given such a type 1 theory we 
realize that a simple recipe to construct a reasonable set of statistical conditions 
is the following: Replace all observables (of the type 1 theory) by averaged values 
using appropriate probability densities. In this way the dynamics of the problem is 
completely transferred from the observables to the probability distributions. This 
program will be carried through in the next sections, using a model system of clas- 
sical mechanics as parent theory. 

The above construction principle describes an unusual situation, because we are 
used to considering determinism (concerning single events) as a very condition for 
doing science. Nevertheless, the physical context, which is referred to is quite simple 
and clear, namely that nature forbids for some reason deterministic description of 
single events but allows it at least "on the average". It is certainly true that we 
are not accustomed to such a kind of thinking. But to believe or not to believe in 
such mechanisms of nature is basically a matter of intellectual habit. Also, the fact 
that quantum mechanics is incomplete does not necessarily imply that a complete 
theory exists; the opposite possibility, that no deterministic description of nature 
will ever be found, should also be taken into account. 

3 Statistical conditions 

We study a simple system, a particle in an externally controlled time-independent 
potential V(x), whose motion is restricted to a single spatial dimension (coordinate 
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x). We use the canonical formalism of classical mechanics to describe this system. 
Thus, the fundamental observables of our theory are x(t) and p(t) and they obey 
the differential equations 

where F[x) = \y e now create statistical conditions, associated with the 

type 1 theory (p}, according to the method outlined in the last section. We replace 
the observables x(t), pit) and the force field F(x{t)) by averages x, p and F, and 
obtain 

A* = L (2) 
dt m K ' 

^P^, (3) 

The averages in ((3J) are mean values of the random variables x or p; there is no 
danger of confusion here, because the symbols x(t) and p(t) will not be used any 
more. In |T]) only terms occur, which depend either on the coordinate or the momen- 
tum, but not on both. Thus, to form the averages we need two probability densities 
p(x,t) and w(p,t), depending on the spatial coordinate x and the momentum p 
separately. Then, the averages occurring in ([2]), ([3]) are given by 

dxp(x,t)x (4) 



dpw(p, t)p (5) 

TT7) ■:- / K dx P (x,t)^-. (6) 



Note that F{x) has to be replaced by F(x) and not by F(x). The probability 
densities p and w are positive semidefinite and normalized to unity. They are time- 
dependent because they describe the dynamic behavior of this theory. 

Relations JS]),©, with the definitions (J6j) are, to the best of my knowledge, 
new. They will be referred to as "statistical conditions" . There is obviously a formal 
similarity of ([2]) , ((3]) with Ehrenfest's relations of quantum mechanics, but the differ- 
ential equations to be fulfilled by p and w are still unknown and may well differ from 
those of quantum theory. Relations ©-@ represent general conditions for theories 
which are deterministic only with respect to statistical averages of observables and 
not with respect to single events. They cannot be associated to either the classical 
or the quantum mechanical domain of physics. Many concrete statistical theories 
(differential equations for the probability distributions) obeying these conditions 
may exist (see the next section). 

These conditions should be supplemented by a local conservation law of prob- 
ability. Assuming that the probability current is proportional to the gradient of a 
function S (this is the simplest possible choice and the one realized in Hamilton- 
Jacobi theory, see also section this conservation law is for our one-dimensional 
situation given by the continuity equation 

dp(x,t) | d p(x,t) dS(x,t) =Q 
dt dx m dx 
The derivative of S(x,t) defines a field with dimension of a momentum, 
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Eq. ([8]) defines a unique number p(x, t) for each value of the random variable x. In 
the next section we will discuss the following question: Are we allowed to identify 
the possible values of the random variable p occurring in Eq. §5§ with the values of 
the momentum field p(x, t) ? 



4 On random variables 

Introducing standard notions of probability theory, the fundamental sample space 
of the present theory is given by all possible results of position measurements, i.e. it 
may be identified with the set of real numbers R. This set R may also be identified 
with the possible values of a random variable "position measurement" (whose name 
should strictly speaking differ from x but we shall neglect such differences here). 
The basic probability measure which assigns a probability to each event (subspace 
of R) is given by p(x, t). According to standard probability theory the field p(x, t) 
defined by ([8]) is itself a random variable. We may consider it as a function of the 
random variable X (denoting "position measurement") or as a random variable 
defined independently on the fundamental event space R; it makes no difference. 
Its probability density is uniquely determined by p(x, t) and the function p(x,t). 
In order to avoid confusion of names it may be useful to denote the derivative of 
S(x, t) with respect to x by g{x,t) instead of p{x,t). Thus p{x,t) = g(x,t) and 
the notation p = g{x,t) indicates that a random variable p defined by the function 
g(x,t) exists (the time variable will sometimes be omitted for brevity). 

In order to study this important point further, we rewrite the standard result 
for the probability density of p — g(x,t) in a form more appropriate for physical 
considerations (a form apparently not easily found in textbooks on probability). 
For the simplest possible situation, a denumerable sample space with elements Xi, 
a probability measure P, and a invertible function g(x), the probability that an 
event Pi = g(xi) occurs is obviously given by W(pi) = P(g^ 1 (pi)). This result is 
the starting point to obtain w(p), the probability density of a continuous random 
variable p = g(x), which is defined by a non-invertible function g{x). It is given 
by [38] 



i=l 



dg-\ P ) 



dp 
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where g~ l {p) denotes the n(p) solutions (the number of solutions depends on p) of 
the equation p — g(x) = 0. Using a well-known formula for Dirac's delta function 6, 
applied to the case where the argument of 5 is an arbitrary function, Eq.@ may 
be rewritten in the form 

w(p, t) = J dx p(x, t)S L - , (10) 

where we came back to our original notation, writing down the t— dependencies of 
p and w and replacing g{x) by dS(x, t)/dx. 

The representation (|10p reveals very clearly a hybrid nature of random variables 
defined as (nontrivial) functions on the event space R. They are partly defined 
by a probabilistic quantity [namely p(x)] and partly by a deterministic relation 
[namely g(x)\. The deterministic nature of the latter is expressed by the singular 
(delta- function) shape of the associated probability. Such densities occur in classical 
statistics, i.e. in type 2 theories; Eq. (|10[) may obviously be obtained by performing 
an integration over x of the classical phase space probability density p(x,t)S(p — 
dS(x,t)/dx). Considered from an operational point of view, the hybrid nature of 
random variables may be described as follows. Deterministic predictions for random 
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variables p = g(x) arc impossible, as are deterministic predictions for the original 
variables x. But once a number x has been observed in an experiment, then the value 
of p = g(x) is with certainty given by the defining function g(x). If no such relation 
exists, this does not necessarily imply that x and p are completely independent. 
Many other more complicated ('nonlocal' or 'probabilistic') relations between such 
variables are conceivable. 

We formulated general conditions comprising both type 2 and type 3 theories. 
Thus, as far as this general framework is concerned we can certainly not dispense 
with the standard notion of random variables, which are basic ingredients of type 2 
theories; such variables will certainly occur as special (type 2) cases in our formalism. 
But, of course, we are essentially interested in the characterization of type 3 theories 
and the form of Eq. (flQ|) shows that the standard notion of random variable is not 
necessarily meaningful in a type 3 theory. Thus we will allow for the possibility of 
random variables which are not defined by deterministic relations of the standard 
type, as functions on the sample space. 

This situation leads to a number of questions. We may, e.g. ask: Can we com- 
pletely dispense with the the standard concept of random variables if we are dealing 
exclusively with a type 3 theory ? The answer is certainly no; it seems impossible 
to formulate a physical theory without any deterministic relations. In fact, a deter- 
ministic relation, corresponding to a standard random variable F(x), has already 
been anticipated in Eq. ([6]). If in a position measurement of a particle a number x 
is observed, then the particle is - at the time of the measurement - with certainty 
under the influence of a force F(x). Thus, an allowed class of deterministic rela- 
tions might contain "given" functions, describing externally controlled influences 
like forces F(x) or potentials V(x). 

There may be other standard random variables. To decide on purely logical 
grounds which relations of a type 3 theory are deterministic and which are not is 
not an obvious matter. However, one would suspect that the deterministic relations 
should be of an universal nature; e.g. they should hold both in type 2 and type 3 
theories. Further, we may expect that all relations which are a logical consequence of 
the structure of space-time should belong to this class. Such a quantity is the kinetic 
energy. In fact, for the currently considered nonrelativistic range of physics, the 
functional form of the kinetic energy can be derived from the structure of the Galilei 
group both in the mathematical framework of classical mechanics |19j and quantum 
mechanics |18j . We refer to the kinetic energy p 2 /2m as a standard random variable 
insofar as it is a prescribed function of p (but it is, because it is a function of p, not 
a standard random variable with respect to the fundamental probability measure 
p). Combining the standard random variables "kinetic energy" and "potential" we 
obtain a standard random variable "energy" , which will be studied in more detail 
in section [6l 

Thus, in the present framework, particle momentum will, in general, not be con- 
sidered as a standard random variable. This means that an element of determinism 
has been eliminated from the theoretical description. It seems that this elimina- 
tion is one of the basic steps in the transition from type 2 to type 3 theories. The 
functional form of the probability density w(p, t), and its relation to p(x, t), are one 
of the main objectives of the present study. According to the above discussion a 
measurement of position does no longer determine momentum at the time of the 
measurement. However the set of all position measurements [represented formally 
by the probability density p(x, t)] may still determine (in a manner still to be clari- 
fied) the set of all momentum measurements [the probability w(p,t)]. Interestingly, 
Torre [6], using a completely different approach, arrived at a similar conclusion, 
namely that the quantum mechanical 'variables' position and momentum cannot 
be random variables in the conventional sense. For simplicity we will continue to 
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use the term random variable for p, and will add the attributes " 'standard" ' or 
"nonstandard" if required. 

As a first step in our study of w(p, t), we will now investigate the integral equa- 
tion ^ and will derive a relation for w(p,t) which will be used again in section [5] 
In the course of the following calculations the behavior of p and S at infinity will 
frequently be required. We know that p(x, t) is normalizable and vanishes at infin- 
ity. More specifically, we shall assume that p(x, t) and S(x, t) obey the following 
conditions: 

^-0, far a -oo, (11) 

ox p ox at 

where A is anyone of the following factors 

dS dS fdS\ 2 

Roughly speaking, condition (|lip means that p vanishes faster than 1/x and S 
is nonsingular at infinity. Whenever in the following an integration by parts will 
be performed, one of the conditions (jll[) will be used to eliminate the resulting 
boundary term. For brevity we shall not refer to pip any more; it will be sufficiently 
clear in the context of the calculation which one of the factors in (fT2"]) will be referred 
to. 

We look for differential equations for our fields p, S which are compatible with ([2|)- 
([7]). According to the above discussion we are not allowed to identify (jHJl with the 
random variable p. Using ([7]) we replace the derivative with respect to t in ^ by 
a derivative with respect to x and perform an integration by parts. Then, |(5J) takes 
the form 

dS(x,t) _ 

axp{x,t) — — — =p. (13) 

Eq. (TiU)) shows that the averaged value of the random variable p is the expectation 
value of the field p(x, t). In the next section we shall insert this expression for p in 
the second statistical condition ([3]). More specific results for the probability density 
w(p,t) will be obtained later (in section [TU]). As an intermediate step, we now 
use (|13p and (JSJl to derive a relation for w(p,t), introducing thereby an important 
change of variables. 

We replace the variables p, S by new variables ipi, ip2 defined by 

_ S _ S 

ipi = y/p cos — ij) 2 = yfp sin — . (14) 

We may as well introduce the imaginary unit and define the complex field -0 = 
■01 + vip2. Then, the last transformation and its inverse may be written as 

= Vpe* 1 (15) 

P = n\ s = ±m±. (i6) 

We note that so far no new condition or constraint has been introduced; choosing 
one of the sets of real variables {p, S}, {ipi, ^2}, or the set {ip, if)*} of complex fields 
is just a matter of mathematical convenience. Using {ip,ip*} the integrand on the 
left hand side of (TiU)) takes the form 



The derivative of \ip\ 2 may be omitted under the integral sign and (fl3|) takes the 
form 

s d f°° 

dxip*-—ip = / dpw(p,t)p. (18) 
I ox J^^ 

We introduce the Fourier transform of tp, defined by 

i>( X ,t) = r dpmt)^ (19) 



<f>(p,t) = 4= dx^(x,t)e-^ x . (20) 

V 27T J-oo 

The constant s, introduced in Eq. (|14[) . has the dimension of an action, which means 
that p has the dimension of a momentum. Performing the Fourier transform one 
finds that the momentum probability density may be written as 

w(p,t) = -\^p,t)\ 2 + h(p,t), (21) 
s 

where the integral over ph(p, t) has to vanish. Using Parseval's formula and the fact 
that both p{x 1 1) and w(p, t) are normalized to unity we find that the integral of 
h(p, t) has to vanish too. 

Using the continuity equation ((7]) and the first statistical condition ([2]) we found 
two results [namely (I18|) and l|2ip ] which reduce for h(p, t) = to characteristic 
relations of the quantum mechanical formalism. However, the function h(p,t), as 
well as the probability density w(p,t) we are finally interested in, is still unknown, 
because the validity of the deterministic relation |5]| is not guaranteed in the present 
general formalism allowing for type 3 theories. In the next section the implications 
of the second statistical condition will be studied without using w(p,t). We shall 
come back to the problem of the determination of w(p, t) in section [7] 



5 Statistical theories 



We study now the implications of the second statistical condition ((3]). Using the 
variables p, S it takes the form 



d_ 

dt 



dxp 



OS 

dx 



dxp 



dV 
dx ' 



(22) 



if p is replaced by the integral on the l.h.s. of (fT3|) . Making again use of 0, we 
replace in (|22"]) the derivative of p with respect to t by a derivative with respect to 
x. Then, after an integration by parts, the left hand side of (|2"2"|) takes the form 



d_ 

df 



dx p 



dx 



dS _ 
dx 

J_dp 

2m dx 



dS_ 



d_dS_ 
p dx~~dt 



(23) 



Performing two more integrations by parts [a second one in (|23l) substituting the 
term with the time-derivative of S, and a third one on the right hand side of (|2"2"1) ]. 
condition Q takes the final form 
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Equation (|24|) can be considered as an integral equation for the real function L(x, t) 
defined by 

dS 1 fdS{x,t) x2 



L ^=m + ^\T^) +VM - (25) 

Obviously, (|24p admits an infinite number of solutions for L(x,t), which are given 
by 

dp{x, t) dQ 

-^- L{x > t] = to' (26) 

The function Q(x,t) in (|26[) has to vanish at x — » ±00 but is otherwise completely 
arbitrary. 

Equation (f2"6| , with fixed Q and L as defined by (|25p , is the second differential 
equation for our variables S and p we were looking for, and defines - together with 
the continuity equation ([7]) - a statistical theory. The dynamic behavior is completely 
determined by these differential equations for S and p. On the other hand, the 
dynamic equation - in the sense of an equation describing the time-dependence of 
observable quantities - is given by ([2]) and ([3]). 

From the subset of functions Q which do not depend explicitely on x and t we 
list the following three possibilities for Q and the corresponding L. The simplest 
solution is 

Q = 0, L = 0. (27) 

The second Q depends only on p, 

Q ~ p n , n>l, L ~ n// 1 " 1 . (28) 

The third Q depends also on the derivative of p, 

We discuss first (|27p . The statistical theory defined by (|2"7) consists of the con- 
tinuity equation (O and [see |25p ] the Hamilton-Jacobi equation, 



dS_ J_ f dS{x,ty 2 
dt 2m \ dx 



+ V(x,t)=0. (30) 



The fact that one of these equations agrees with the Hamilton-Jacobi equation does 
not imply that this theory is a type 1 theory (making predictions about individual 
events) . This is not the case; many misleading statements concerning this limit may 
be found in the literature. It is a statistical theory whose observables are statistical 
averages. However, Eq. Q30p becomes a type 1 theory if it is considered separately 
- and embedded in the theory of canonical transformations. The crucial point is 
that (|3"0|) does not contain p; otherwise it could not be considered separately. This 
separability - or equivalently the absence of p in (|30| - implies that this theory is 
a classical (type 2) statistical theory [32]. The function S may be interpreted as 
describing the individual behavior of particles in the given environment (potential 
V). Loosely speaking, the function S may be identified with the considered parti- 
cle; recall that S is the function generating the canonical transformation to a trivial 
Hamiltonian. The identity of the particles described by S is not influenced by sta- 
tistical correlations because there is no coupling to p in ([3H| . The classical theory 
defined by (J7J) and (|3"0|) may also be formulated in terms of the variables ip and 
ip* [ but not as a single equation containing only tp; see the remark at the end of 
section (J5J]. In this form it has been discussed in several works [33l[32ll30] . 
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All theories with nontrivial Q, depending on p or its derivatives, should be clas- 
sified as "non-classical" (or type 3) according to the above analysis. In non-classical 
theories any treatment of single events (calculation of trajectories) is impossible 
due to the coupling between S and p. The problem is that single events are nev- 
ertheless real and observable. There must be a kind of dependence (correlation of 
non-classical type, entanglement) between these single events. But this dependence 
cannot be described by concepts of deterministic theories like "interaction" . 

The impossibility to identify objects in type 3 theories - independently from the 
statistical context - is obviously related to the breakdown of the concept of standard 
random variables discussed in the last section. There, we anticipated that a standard 
random variable (which is defined as a unique function of another random variable) 
contains an element of determinism that should be absent in type 3 theories. In 
fact, it does not make sense to define a unique relation between measuring data - 
e.g. of spatial position and momentum - if the quantities to be measured cannot 
themselves be defined independently from statistical aspects. 

The theory defined by Eq. ([28]) is a type 3 theory. We will not discuss it in detail 
because it may be shown (see the next section) to be unphysical. It has been listed 
here in order to have a concrete example from the large set of insignificant type 3 
theories. 

The theory defined by Eq. (|29|) is also a type 3 theory. Here, the second statistical 
condition takes the form 



dS_ 
dt 



1 

2m 



dS_ 

dx 



h 2 1 d 2 Jp 
V — = 0, 



2m J~p dx 2 



(31) 



if the free proportionality constant in (|29|) is fixed according to h 2 /m. The two 
equations (0) and (|3 1 1) may be rewritten in a more familiar form if the transforma- 
tion (|16p (with s = ft) to variables ip, ip* is performed. Then, both equations are 
contained (as real and imaginary parts) in the single equation 



Hdip 
~i~dt 



h 2 d 2 ip 
2m dx 2 



Vif,, 



(32) 



which is the one-dimensional version of Schrodinger's equation |34j . Thus, quantum 
mechanics belongs to the class of theories defined by the above conditions. We 
see that the statistical conditions ([2]), Q comprise both quantum mechanical and 
classical statistical theories; these relations express a "deep-rooted unity" j35j of the 
classical and quantum mechanical domain of physics. 

Of course, Eq. (|3"2"j) may also be derived using the variables ip, ip* from the 
beginning. We give an outline of this calculation which is quite instructive in some 
respects. Using ip, ip* the continuity equation ([7]) takes the form 



dip 
~dt 



2m dx 2 



tW(x,t)ip 



ip* + c.c. = 0. 



(33) 



This looks similar to (I32p [the function W(x, t) is arbitrary and could be set equal to 
V(a;)] but (|3"3"|) is a one-component relation which yields only the real part of (f3"2"| . In 
a second step, Eq. ([3]) is rewritten in terms of ip, ip* and a number of rearrangements 
are performed, leading to 



, 1 1 dipip* | 
dx — — — < 

i iptp* ox [ 

dip s d 2 ip 
dt 2m dx 2 



8m 

■ i — ip 
s 



1 dip dipip* 
ip dx dx 



c.c. 



4§v 

dx dx 



c.c. 



(34) 
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Performing an integration by parts and using the boundary condition 



1 



( 



dx 



) 



2 



for x 



(35) 



one finds that the first bracket in (fM)) may be omitted under the integral sign. Then 
Eq. (|3~i|) implies that ([3]) is fulfilled provided the condition 



holds. Eqs. (|3U|) and (f3"3"| together imply (f3"2"| . Roughly speaking, the continuity 
equation and the second statistical condition provide the real and imaginary parts 
respectively of Schrodinger's equation. 

We found an infinite number of statistical theories which are all compatible 
with our basic conditions and are all on equal footing so far. However, only one 
of them, quantum mechanics, is realized by nature. This situation leads us to ask 
which further conditions are required to single out quantum mechanics from this set. 
Knowing such condition(s) would allow us to have premises which imply quantum 
mechanics. 

The above analysis shows that Schrodinger's equation (|3"2")l can be derived from 
the condition that the dynamic law for the probabilities takes the form of a sin- 
gle equation for ip (instead of two equations for ip and ip* as is t ne case for all 
other theories). Our previous use of the variables ip and ip* instead of S and p 
was entirely a matter of mathematical convenience. In contrast, this last condition 
presents a real constraint for the physics since a different number field has been 
chosen }22j . Recently, Schrodingers equation including the gauge coupling term has 
been derived [24j from this condition (which had to be supplemented by two fur- 
ther conditions, namely the existence of a continuity equation and the assumption 
of a linear time evolution law for ip). Of course, this is a mathematical condition 
whose physical meaning is not at all clear. This formal criterion will be replaced in 
section [S] by a different condition which leads to the same conclusion but may be 
formulated in more physical terms. 

6 Energy conservation 

In the last section [section ([5])] we derived a second differential equation (f2l)|) for our 
dynamical variables p and S. This equation has some terms in common with the 
Hamilton- Jacobi equation of classical mechanics but contains an unknown function 
Q depending on p and S; in principle it could also depend on x and t but this would 
contradict the homogeneity of space-time. We need further physical condition(s) to 
determine those functions Q which are appropriate for a description of quantum 
mechanical reality or its classical counterpart. 

A rather obvious requirement is conservation of energy. In deterministic theo- 
ries conservation laws - and in particular the energy conservation law which will be 
considered exclusively here - are a logical consequence of the basic equations; there 
is no need for separate postulates in this case. In statistical theories energy con- 
servation with regard to time-dependence of single events is of course meaningless. 
However, a statistical analog of this conservation law may be formulated as follows: 
"The statistical average of the random variable energy is time-independent" . In the 
present framework it is expressed by the relation 




(36) 




(37) 
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We will use the abbreviation E = T+V for the bracket where T denotes the first and 
V denotes the second term respectively. Here, in contrast to the deterministic case, 
the fundamental laws [namely ((2]), 0, 0] do not guarantee the validity of (|37|) . It 
has to be implemented as a separate statistical condition. In fact, Eq. (|37p is very 
simple and convincing; it seems reasonable to keep only those statistical theories 
which obey the statistical version of the fundamental energy conservation law. 

In writing down Eq. ()37|1 a second tacit assumption, besides the postulate of en- 
ergy conservation, has been made, namely that a standard random variable "kinetic 
energy" exists; this assumption has already been formulated and partly justified in 
the last section. This means, in particular, that the probability density w(p, t), which 
has been introduced in the statistical conditions d2J) , ([3j) to obtain the expectation 
value of p may also be used to calculate the expectation value of p 2 . This second as- 
sumption is - like the requirement of energy conservation - not a consequence of the 
basic equations (f2"o) , Q . The latter may be used to calculate the probability den- 
sity p but says nothing about the calculation of expectation values of p— dependent 
quantities. Thus, Eq. (137j) is an additional assumption, as may also be seen by the 
fact that two unknown functions, namely h and Q occur in (|37p . 

Eq. (|3T)l defines a relation between Q and h. More precisely, we consider variables 
p and S which are solutions of the two basic equations (|26|) and l|7|). where Q may 
be an arbitrary function of p, S. Using these solutions we look which (differential) 
relations between Q and h are compatible with the requirement (|37|) . Postulating 
the validity of (f3"Tj) implies certain relations (yet to be found in explicit form) be- 
tween the equations determining the probabilities and the equations defining the 
expectation values of p-dependent quantities (like the kinetic energy). 

In a first step we rewrite the statistical average of p 2 in (|3T[) using (|2Tj) . The 
result is 



dpw(p, t)p 2 



s d 



dp h(p, t)p 2 



(38) 



as may be verified with the help of (|20|) . Using 
first term in the bracket in (l37l) takes the form 



38|) and transforming to p. S, the 
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(39) 



The imaginary part of the integrand as well as the second derivative of p do not 
contribute to the integral. We next calculate the time derivative of the remaining 
two terms and perform an integration by parts. Then, the first part of (|37[) takes 
the form 



dr 

dt 



1 

2m 



dp_dS_ 

dx dx ^ dx 2 




(40) 



If we add the time derivative of V to (|40f we obtain the time derivative of E, as 
defined by the left hand side of (|57]) . In the integrand of the latter expression the 
following term occurs 
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The two brackets in (|4Tj) may be rewritten with the help of (JT]) and ([26 
term (|4"Tj) takes the much simpler form 



2 m 



dp 

Or 



dQdp 
dx dt 



Then, the 



(42) 



Using (|40p and (|4"2")) our statistical condition (|3~T)) is given by 

s 2 d 2 p 



dE 
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(43) 



0, 



Finally, combining the first two terms in (|43|) to a single one, we find that ([37 
implies the following integral relation between Q and h. 



dx 



dp_ 

dx 



dQ 

dx 



1 d 2 VP 



2m y/p dx 2 



dp 
dt 



dt 2m 



(44) 



Let us first investigate the classical solution. We may either insert the classical, 
"hybrid" solution (|10p for w(p,t) directly into Eq. (|3"T|) or insert h(p,t) according 
to (J2TJ with w(p, t) as given by (|10p in (|44p . to obtain 



dx 



dp 
dx 



1 dQdp 
dx dt 



= 0, 



(45) 



which implies dQ/dx = 0. Thus, the hybrid probability density (|10p leads, as ex- 
pected, to a classical (the equation for S does not contain terms dependent on p) 
statistical theory, given by the Hamilton- Jacobi equation and the continuity equa- 
tion. These equations constitute the classical limit of quantum mechanics which 
is a statistical theory (of type 2 according to the above classification) and not a 
deterministic (type 1) theory like classical mechanics. This difference is very im- 
portant and should be borne in mind. The various ambiguities |25j one encounters 
in the conventional particle picture both in the transitions from classical physics 
to quantum mechanics and back to classical physics, do not exist in the present 
approach. 

If we insert the quantum-mechanical result (|29p with properly adjusted constant 
in (144]) . we obtain 

2 

dph(p,t)^-=T 0l (46) 

where To is an arbitrary time- independent constant. This constant reflects the possi- 
bility to fix a zero point of a (kinetic energy) scale. An analogous arbitrary constant 
Vq occurs for the potential energy. Since kinetic energy occurs always (in all physi- 
cally meaningful contexts) together with potential energy, the constant To m ay be 
eliminated with the help of a properly adjusted Vq. Therefore, we see that - as far 
as the calculation of the expectation value of the kinetic energy is concerned - it 
is allowed to set h = 0. Combined with previous results, we see that h may be set 
equal to as far as the calculation of the expectation values of p n , for n = 0, 1,2 
is concerned. These cases include all cases of practical importance. A universal rule 
for the calculation of averages of arbitrary powers of p is not available in the present 
theory. The same is true for arbitrary powers of x and p. Fortunately, this is not 
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really a problem since the above powers cover all cases of physical interest, as far 
as powers of p are concerned (combinations of powers of x and p do not occur in 
the present theory and will be dealt with in a future work). 

It is informative to compare the present theory with the corresponding situation 
in the established formulations of quantum mechanics. In the conventional quan- 
tization procedure, which is ideologically dominated by the structure of particle 
mechanics, it is postulated that all classical observables (arbitrary functions of x 
and p) be represented by operators in Hilbert space. The explicit construction of 
these operators runs into considerable difficulties 37J for all except the simplest 
combinations of x and p. But, typically, this does not cause any real problems since 
all simple combinations (of physical interest) can be represented in a unique way by 
corresponding operators. Thus, what is wrong - or rather ill-posed - is obviously the 
postulate itself, which creates an artificial problem. This is one example, among sev- 
eral others, for an artificial problem created by choosing the wrong (deterministic) 
starting point for quantization. 

If we start from the r.h.s. of ((44)) and postulate h = 0, then we obtain agreement 
with the standard formalism of quantum mechanics, both with regard to the time 
evolution equation and the rules for calculating expectation values of p— dependent 
quantities. Thus, h — is a rather strong condition. Unfortunately, there seems be 
no intuitive interpretation at all for this condition. It is even less understandable 
than our previous formal postulate leading to Schrodinger's equation, the require- 
ment of a complex state variable. Thus, while we gained in this section important 
insight in the relation between energy conservation, time-evolution equation and 
rules for calculating expectation values, still other methods are required if we want 
to derive quantum mechanics from a set of physically interpretable postulates. 



7 Entropy as a measure of disorder ? 

How then to determine the unknown function h(p,t) [and w{p,t)\ ? According to 
the last section, all the required information on h(p,t) may be obtained from a 
knowledge of the term Q in the differential equation <PS|) for p(x,t). We shall try 
to solve this problem my means of the following two-step strategy: (i) Find an 
additional physical condition for the fundamental probability density p(x,t), (ii) 
determine the shape of Q [as well as that of h(p, t) and w(p, t)] from this condition. 

At this point it may be useful to recall the way probability densities are de- 
termined in classical statistical physics. After all, the present class of theories is 
certainly not of a deterministic nature and belongs fundamentally to the same class 
of statistical (i.e. incomplete with regard to the description of single events) theories 
as classical statistical physics; no matter how important the remaining differences 
may be. 

The physical condition for p which determines the behavior of ensembles in clas- 
sical statistical physics is the principle of maximal (Boltzmann) entropy. It agrees 
essentially with the information-theoretic measure of disorder introduced by Shan- 
non [36j . Using this principle both the micro-canonical and the canonical distribu- 
tion of statistical thermodynamics may be derived under appropriate constraints. 
Let us discuss this classical extremal principle in some detail in order to see if it can 
be applied, after appropriate modifications, to the present problem. This question 
also entails a comparison of different types of statistical theories. 

The Boltzmann-Shannon entropy is defined as a functional S[p] of an arbitrary 
probability density p. The statistical properties characterizing disorder, which may 
be used to define this functional, are discussed in many publications [4], |21j . Only 
one of these conditions will, for later use, be written down here, namely the so-called 
"composition law" : Let us assume that p may be written in the form p = p\pi where 
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Pi, i = 1, 2 depends only on points in a subspace Xi of our n— dimensional sample 
space X and let us further assume that X is the direct product of X\ and X^. Thus, 
this system consists of two independent subsystems. Then, the composition law is 
given by 

S[p 1 p 2 } = S^[p 1 } + S^[p 2 ], (47) 

where operates only on X^. 

For a countable sample space with events labeled by indices i from an index set 
/ and probabilities pi, the entropy is given by 

S[p]=-kJ2pi ] *Ph (48) 
iei 

where k is a constant. To obtain meaningful results the extrema of (|48p under appro- 
priate constraints, or subsidiary conditions, must be found. The simplest constraint 
is the normalization condition ^ pi = 1 . In this case the extrema of the function 

F[p,X} = -kJ2 ftlnpi + A (^pi-lj (49) 
iei Vie/ ) 

with respect to the variables pi, ..-Pn, A must be calculated. One obtains the rea- 
sonable result that the minimal value of F[p,X) is (one of the pi equal to 1, all 
other equal to 0) and the maximal value is klnN (all pi equal, pi — l/N). 

For most problems of physical interest the sample space is non-denumerable. A 
straightforward generalization of Eq. (|48|) is given by 



S[p] =-kl dxp{x)hip{x), (50) 

where the symbol x denotes now a point in the appropriate (generally n— dimensional) 
sample space. There are some problems inherent in the this straightforward transi- 
tion to a continuous set of events which will be mentioned briefly in the next section. 
Let us put aside this problems for the moment and ask if (|50p makes sense from 
a physical point of view. For non-denumerable problems the principle of maximal 
disorder leads to a variational problem and the method of Lagrange multipliers may 
still be used to combine the requirement of maximal entropy with other defining 
properties (constraints). An important constraint is the property of constant tem- 
perature which leads to the condition that the expectation value of the possible 
energy values E(x) is given by a fixed number E, 

dxp{x)E(x), (51) 

If, in addition, normalizability is implemented as a defining property, then the true 
distribution should be an extremum of the functional 

k \p\ = -k fdx p(x) In p(x) - A 2 / dx p(x)E(x) - \ 1 dx p{x). (52) 



It is easy to see that the true (canonical) distribution 

e -\ 2 E(x) 

p{x) = Jd^W' (53) 

is indeed an extremum of K[p\. The parameter k has been set equal to 1 and A 2 is 
to be determined from 

-^-ln / dxc- X2E(x) = E. (54) 
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Thus, the classical canonical distribution can be obtained in this simple way from 
a principle of maximal disorder (entropy). Can we use a properly adapted version 
of this powerful principle to solve our present problem ? 

Let us compare the class of theories derived in section [5] with classical theories 
like (|53|) . This may be of interest also in view of a possible identification of 'typi- 
cal quantum mechanical properties' of statistical theories. We introduce for clarity 
some notation, based on properties of the sample space. Classical statistical physics 
theories like (|B"3")) will be referred to as " phase space theories" . The class of statistical 
theories, derived in section [5j will be referred to as "configuration space theories". 

The most fundamental difference between phase space theories and configuration 
space theories concerns the physical meaning of the coordinates. The coordinates x 
of phase space theories are (generally time-dependent) labels for particle properties. 
In contrast, configuration space theories are field theories; individual particles do 
not exist and the (in our case one-dimensional) coordinates x are points in space. 

A second fundamental difference concerns the dimension of the sample space. 
Elementary events in phase space theories are points in phase space (of dimen- 
sion 6 for a 1— particle system) including configuration-space and momentum-space 
(particle) coordinates while the elementary events of configuration space theories 
are (space) points in configuration space (which would be of dimension 3 for a 1— 
particle system in three spatial dimensions). This fundamental difference is a con- 
sequence of a (generally nonlocal) dependence between momentum coordinates and 
space-time points contained in the postulates of the present theory, in particular in 
the postulated form of the probability current [see {J}]. This assumption, a proba- 
bility current, which takes the form of a gradient of a function S (multiplied by p) 
is a key feature distinguishing configuration space theories, as potential quantum- 
like theories, from the familiar (many body) phase space theories. The existence of 
this dependence per se is not an exclusive feature of quantum mechanics, it is a 
property of all theories belonging to the configuration class, including the theory 
characterized by h — 0, which will be referred to as "classical limit theory". What 
distinguishes the classical limit theory from quantum mechanics is the particular 
form of this dependence; for the former it is given by a conventional functional rela- 
tionship (as discussed in section|4]) for the latter it is given by a nonlocal relationship 
whose form is still to be determined. 

This dependence is responsible for the fact that no "global" condition [like ([51]) 
for the canonical distribution (|53|l ] must be introduced for the present theory in 
order to guarantee conservation of energy in the mean - this conservation law can 
be guaranteed "locally" for arbitrary theories of the configuration class by adjusting 
the relation between Q (the form of the dynamic equation) and h (the definition of 
expectation values). In phase space theories the form of the dynamical equations is 
fixed (given by the deterministic equations of classical mechanics). Under constraints 
like (|5ip the above principle of maximal disorder creates - basically by selecting 
appropriate initial conditions - those systems which belong to a particular energy; 
for non-stationary conditions the deterministic differential equations of classical 
mechanics guarantee then that energy conservation holds for all times. In contrast, 
in configuration space theories there are no initial conditions (for particles). The 
conditions which are at our disposal are the mathematical form of the expectation 
values (the function h) and/or the mathematical form of the differential equation 
(the function Q). Thus, if something like the principle of maximal disorder can be 
used in the present theory it will determine the form of the differential equation for 
p rather than the explicit form of p. 

These considerations raise some doubt as to the usefulness of an measure of 
disorder like the entropy (I50|) - which depends essentially on E instead of x and 
does not contain derivatives of p - for the present problem. We may still look for an 
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information theoretic extremal principle of the general form 

I[p] + \iCi[p] — » extremum. (55) 

Here, the functional I[p] attains its maximal value for the function p which describes 
- under given constraints C'i[p] - the maximal disorder. But I[p] will differ from the 
entropy functional and appropriate constraints C;[p], reflecting the local character 
of the present problem, have still to be found. Both terms in ([55]) are at our disposal 
and will be defined in the next sections. 



8 Fisher's information 

A second measure of disorder, besides entropy, exists which is called Fisher informa- 
tion [H] . The importance of this second type of "entropy" for the mathematical form 
of the laws of physics - in particular for the terms related to the kinetic energy - has 
been stressed in a number of publications by Frieden and coworkers [91110] and has 
been studied further by Hall [15], Reginatto [31] and others. The Fisher functional 
I[p] is defined by 

I[p] = Jdxp(x)(^j , (56) 

where p' denotes dp/dx in the present one-dimensional case, and the n— component 
vector Dp = (dp/dxi, . . . , dp/dx n ) if x = (xj., . . . , x n ). Since the time variable t 
does not play an important role it will frequently be suppressed in this section. 

The Boltzmann-Shannon entropy (|50[) and the Fisher information (|56[) have a 
number of crucial statistical properties in common. We mention here, for future 
reference, only the most important one, namely the composition law ()47|) : a more 
complete list of common properties may be found in the literature [43]). Using the 
notation introduced in section [7] [see the text preceeding Eq. (|47p] it is easy to see 
that Eq. (|55|) fulfills the relation 

I[p 1 p 2 ]=I^[p 1 ]+I^[p 2 }, (57) 



in analogy to Eq. 1|47|) for the entropy S. The most obvious difference between ([50]) 
and ([56]) is the fact that ([56]) contains a derivative while ([50]) does not. As a conse- 
quence, extremizing these two functionals yields fundamentally different equations 
for p, namely a differential equation for the Fisher functional / and an algebraic 
equation for the entropy functional S. 

The two measures of disorder, S and /, are related to each other. To find this 
relation, it is necessary to introduce a generalized version of ([50]) . the so-called 
"relative entropy" . It is defined by 

G[p,a}=- [dxp(x)ln^\, (58) 
J a(x) 

where a(x) is a given probability density, sometimes referred to as the "prior" [the 
constant k in Eq. ([50]) has been suppressed here]. It provides a reference point for 
the unknown p\ the best choice for p is to be determined from the requirement of 
maximal relative entropy G[p, a] under given constraints, where a represents the 
state of affairs (or of our knowledge of the state of affairs) prior to consideration of 
the constraints. The quantity —G[p,a] agrees with the "Kullback-Leibler distance" 
between two probability densities p and a [25] , 

It has been pointed out that "all entropies are relative entropies" [5]. In fact, 
all physical quantities need reference points in order to become observables. The 
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Boltzmann-Shannon entropy (|50[) is no exception. In this case, the 'probability 
density' a(x) is a number of value 1, and of the same dimension as p(x); it de- 
scribes absence of any knowledge or a completely disordered state. We mention also 
two other more technical points which imply the need for relative entropies. The 
first is the requirement to perform invariant variable transformations in the sample 
space [5] , the second is the requirement to perform a smooth transition from discrete 
to continuous probabilities [17] . 

Thus, the concept of relative entropies is satisfying from a theoretical point of 
view. On the other hand it seems to be useless from a practical point of view since it 
requires - except in the trivial limit a = 1 - knowledge of a new function a{x) which 
is in general just as unknown as the original unknown function p(x). A way out of 
this dilemma is to identify a(x) with a function p(Tx), which can be obtained from 
p(x) by replacing the argument x by a transformed argument Tx. In this way we 
obtain from (fB"5|) a quantity G[p;pr] which is a functional of the relevant function p 
alone; in addition it is an ordinary function of the parameters px characterizing the 
transformation. The physical meaning of the relative entropy remains unchanged, 
the requirement of maximal relative entropy G[p;pt] becomes a condition for the 
variation of p in the sample space between the points Tx and x. 

If further consideration is restricted to translations Tx = x + Ax (it would be 
interesting to investigate other transformations, in particular if the sample space 
agrees with the configuration space) then the relative entropy is written as 



Expanding the integrand on the r.h.s. of ([55)1 up to terms of second order in Ax 
and using the fact that p and p' have to vanish at infinity one obtains the relation 



This, then is the required relation between the relative entropy G and and the Fisher 
information /; it is valid only for sufficiently small Ax. The relative entropy G[p; Ax] 
cannot be positive. Considered as a function of Ax it has a maximum at Ax = 
(taking its maximal value 0) provided / > 0. This means that the principle of 
maximal entropy implies no change at all relative to an arbitrary reference density. 
This provides no criterion for p since it holds for arbitrary p. But if (|60[) is considered, 
for fixed Ax, as a functional of p, the principle of maximal entropy implies, as a 
criterion for the spatial variation of p, a principle of minimal Fisher information. 

Thus, from this overview (see Frieden's book [TU] for more details and several 
other interesting aspects) we would conclude that the principle of minimal Fisher 
information should not be considered as a completely new and exotic matter. Rather 
it should be considered as an extension or generalization of the classical principle 
of maximal disorder to a situation where a spatially varying probability exists, 
which contributes to disorder. This requires, in particular, that this probability 
density is to be determined from a differential equation and not from an algebraic 
equation. We conclude that the principle of minimal Fisher / is very well suited for 
our present purpose. As a next step we have to set up proper constraints for the 
extremal principle. 

9 Subsidiary condition 




(59) 



G[p; Ax 




(60) 



It will be convenient in the course of the following calculations to write the differ- 
ential equation (|26| in the form 

L(x,t)-L o (p,p',p") = 0, (61) 
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where L(x,t) is given by (|25|) and Lq is denned by 



L (p,p',p") 



(62) 



In (|62|) it has been assumed that Lo does not depend explicitely on x,t, that the 
problem is basically of a time-independent nature, and that no higher derivatives 
with respect to x than p" occur. This last assumption is in agreement with the 
mathematical form of all fundamental differential equations of physics; we shall 
come back to this point later. Our task is to determine the functional form of Lq, 
with respect to the variables p, p', p", using a general statistical extremal principle. 
As a consequence of the general nature of this problem we do not expect the solution 
to depend on the particular form of L(x, t). For the same reason Lq does not depend 



We tentatively formulate a principle of maximal disorder of the form (|55[) and 
identify I[p] with the Fisher functional (|56p. Then, the next step is to find a proper 
constraint C[p\. In accord with general statistical principles the prescribed quantity 
should have the form of a statistical average. A second condition is that our final 
choice should be as similar to the classical requirement (|5Tj) as possible. Adopting 
these criteria one is led more or less automatically to the constraint 



Our guideline in setting up this criterion has been the idea of a prescribed value of 
the average energy; the new term Lq plays the role of an additional contribution 
to the energy. For S(x,t) = —Et + Sq(x) and Lq = the constraint (|rJ5|) agrees 
with (|5Tj) provided the 'classical' identification of p with the gradient of Sq(x) is 
performed [see The most striking difference between (|5Tj) and (|63|) is the fact 
that the quantity L — Lq in (|64|) . whose expectation value yields the constraint, is 
not defined independently from the statistics [like E in (f5Tj) ] but depends itself on 
p (and its derivatives up to second order) . This aspect of non-classical theories has 
already been discussed in section [5J 

Let us try to apply the mathematical apparatus of variational calculus [44] to 
the constraint problem (155)) with the '"entropy"' functional I[p] defined by ([56]) and 
a single constraint defined by (|63[) and (there is no normalization condition 
here because we do not want to exclude potentially meaningful non-normalizable 
states from the consideration) . Here we encounter immediately a first problem which 
is due to the fact that our problem consists in the determination of an unknown 
function Lq of p, p' . p" . This function appears in the differential equation and in the 
subsidiary condition for the variational problem. Thus, our task is to identify from 
a variational problem the functional form of a constraint defining this variational 
problem. Variational calculus, starts, of course, from constraints whose functional 
forms are fixed] these fixed functionals are used to derive differential equations for 
the variable p. Thus, whenever the calculus of variations is applied, the function 
Lq(p, p' , p") must be considered as unknown but fixed. We shall have to find a way 
to 'transform' the condition for the variation of p{x) in a corresponding condition 
for the variation of L (p, p' , p"). 

The variational calculation defined above belongs to a class of 'isoperimetric' 
variational problems which can be solved using the standard method of Lagrange 
multipliers, provided certain mathematical conditions are fulfilled 44J. Analyzing 
the situation we encounter here a second problem, which is in fact related to the 



on S. 



C[p] = 0, 



(63) 



where 




(64) 
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first. Let us briefly recall the way the variational problem (|55p is solved, in partic- 
ular with regard to the role of the Lagrange multipliers A^ [43] . Given the problem 
to find an extremal po(x) of I[p] under m constraints of the form CJp] = and 
two prescribed values of p at the boundaries, one proceeds as follows. The Euler- 
Lagrange equation belonging to the functional ([55]) is solved. The general solution 
for p depends (besides on x) on two integration constants, say C\ and C*2, and on 
the m Lagrange multipliers Ai,...,A m . To obtain the final extremal po(x), these 
m + 2 constants have to be determined from the two boundary values and the m 
constraints (which are differential equations for isoperimetric problems). This is ex- 
actly the way the calculation has been performed (even though a simpler form of 
the constraints has been used) in the classical case. For the present problem, how- 
ever, this procedure is useless, since we do not want the constraints to determine 
the shape of individual solutions but rather the functional form of a term in the 
differential equation, which is then the same for all solutions. For that reason the 
'normal' variational problem (|55|) does not work (we shall come back to a mathe- 
matical definition of 'normal' and 'abnormal' shortly). This means that the classical 
principle of maximal entropy, as discussed in section[7l cannot be taken over literally 
to the non-classical domain. 

For the same reason, no subsidiary conditions can be taken into account in the 
calculations reported by Frieden [9] and by Reginatto [31]. In these works, a different 
route is chosen to obtain Schrodingers equation; in contrast to the present work (see 
below) the Fisher functional is added as a new term to a classical Lagrangian and 
the particular form of this new term is justified by introducing a new " 'principle of 
extreme physical information" ' . 

A variational problem is called 'normal' if an extremal of the functional I[p] + 
XiC[p] (here we restrict ourselves to the present case of a single constraint) exists 
which is not at the same time an extremal of the constraint functional C[p\. If this 
is not the case, i.e. if the extremal is at the same time an extremal of C[p], then 
the problem is called 'abnormal' [44]. Then, the usual derivation becomes invalid 
and the condition (|5"5"]) must be replaced by the condition of extremal C[p] alone, 

\\C\p] — > extremum, (65) 

which then yields 5C[p] = as only remaining condition to determine the extremal. 
This type of problem is also sometimes referred to as " rigid" ; the original formu- 
lation (|55p may be extended to include the abnormal case by introducing a second 
Lagrange multiplier [44j . 

We conclude that our present problem should be treated as an abnormal varia- 
tional problem since we thereby get rid of our main difficulty, namely the unwanted 
dependence of individual solutions on Lagrange multipliers [Ai drops actually out 
of Eq. ([6"5|) ]. A somewhat dissatisfying (at first sight) feature of this approach is 
the fact that the Fisher functional I[p] itself does no longer take part in the vari- 
ational procedure; the original idea of implementing maximal statistical disorder 
seems to have been lost. But it turns out that we shall soon recover the Fisher / in 
the course of the following calculation. The vanishing of the first variation of C[p], 
written explicitely as 

8C[p] =6 J dxp [L(x, t) - L Q (p, p', p")] = 0, (66) 

means that (for fixed Lq) the spatial variation of p should extremize (minimize) 
the average value of the deviation from L(x,t). This requirement is [as a condition 
for p{x)\ in agreement with the principle of minimal Fisher information as a spe- 
cial realization of the requirement of maximal disorder. Eq. (|66[) defines actually a 
Lagrangian for p and yields as Euler-Lagrange equations a differential equation for 
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p. When this equation is derived the task of variational calculus is finished. On the 
other hand, we know that p obeys also Eq. (|61[) . Both differential equations must 
agree and this fact yields a condition for our unknown function L . Eq. (|61[) also 
guarantees that the original constraint (|63|) is fulfilled. In this way we are able to 
'transform' the original variational condition for p{x) in a condition for Lq(p, p' , p"). 
In the next section this condition will be used to calculate Lq and to recover the 
form of the Fisher /. 

It should be mentioned that Eq. ()66|1 has been used many times in the last eighty 
years to derive Schrodinger's equation from the Hamilton-Jacobi equation. The first 
and most important of these works is Schrodinger's "'Erste Mitteilung" ' 34J. In all 
of these papers Lq is not treated as an unknown function but as a given function, 
constructed with the help of the following procedure. First, a transformation from 
the variable S to a complex variable tp' = exp[iS/h] is performed. Secondly, a 
new variable p is introduced by means of the formal replacement ip' ip — pip' . 
This creates a new term in the Lagrangian, which has exactly the form required 
to create quantum mechanics. More details on the physical motivations underlying 
this replacement procedure may be found in a paper by Lee and Zhu [27). It is 
interesting to note that the same formal replacement may be used to perform the 
transition from the London theory of superconductivity to the Ginzburg-Landau 
theory 23J. There, the necessity to introduce a new variable is obvious, in contrast 
to the present much more intricate situation. 



10 Derivation of Lq 

For a general Lq, the Euler-Lagrange equation belonging to the functional C[p] 
[see (|66|) ] depends on derivatives higher than second order since the integrand in (|66p 
depends on p" . This is a problem, since according to the universal rule mentioned 
above all differential equations of physics are formulated using derivatives not higher 
than second order. If we are to conform with this general rule (and we would like 
to do so) then we should use a Lagrangian containing only first order derivatives. 
But this would then again produce a conflict with Eq. (|6Tj) because the variational 
procedure increases the order of the highest derivative by one. We postpone the 
resolution of this conflict and proceed by calculating the Euler-Lagrange equations 
according to (|66|. which are given by 

L(x,t)-L (p,p',p") 

__d^_ 8Lo + ,dLo + ±dLo^ ^£ = (6?) 
dx 2 ^ dp" ^ dp' ^ dx dp' ^ dp 

Using the second basic condition (fBTj) we see that the first line of Eq. (p7)l vanishes 
and we obtain, introducing the abbreviation (3 = pLq, the following partial differ- 
ential equation for the determination of the functional form of Lq with respect to 
the variables p, p' , p" , 

+ Af _f + L (68) 
dx 2 dp" dx dp' dp p 

Expressing the derivatives of L in terms of the derivatives of p, p' and p" leads to 
a lengthy relation which will not be written down here. Since Lq does not contain 
higher derivatives than p", the sums of the coefficients of both the third and fourth 
derivatives of p have to vanish. This implies that [3 may be written in the form 



0{P, p', P") = 0(p, p)p" + D{p, p'), 



(69) 
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where C(p,p') and D(p,p') are solutions of 

dC d 2 C , d 2 D C „ 

" 2 ^- W p+ a(7F + 7 = (7) 

9 2 C^ /)2 | d 2 £> d£> | D _ Q 
<9p 2 dpdp' dp p 

Thus, two functions of p, p' have to be found, instead of a single function of p, p' , p" . 
Fortunately, the solution we look for presents a term in a differential equation. 
This allows us to restrict our search to relatively simple solutions of ([70|) . (jTTj) . 
If the differential equation is intended to be comparable in complexity to other 
fundamental laws of physics, then a polynomial form, 

oo oo 

C{p,p')= Cn,np n ( P T, D( P)P ')= ]T <WV)"\ (72) 

n,m — — oo n,m— — oo 

preferably with a finite number of terms, will be a sufficiently general Ansatz. 

Eqs. (|7D|) and (|7ip must of course hold for arbitrary p, p' . Inserting the Ansatz (f72"l) . 
renaming indices and comparing coefficients of equal powers of p and p' one obtains 
the relations 

(nm + 2n + m + l)c n+ i, m = (m + l)(m + 2)d n , m+2 (73) 
(n + l)(n + 2)c„+2,m-2 = (nm - n + m)d„+i, m (74) 

to determine c„ !m and c?„ !m . These relations may be used to calculate those values of 
n, m which allow for non- vanishing coefficients and to calculate the proportionality 
constants between these coefficients; e.g. Eq. d73l) may be used to express d n , m +2 
in terms of c n +i !m provided m ^= —I and m =/= —2. One obtains the result that the 
general solution /3(p, p' , p") of (|68|) of polynomial form is given by (|69|) . with 

c(p, P ') = E c '«^^)^ ( 75 ) 

D(p,p') = Ap- J] ^^p-'tp')^ 2 , (76) 
A — ' n — I 

where A and C n are arbitrary constants and the index set X is given by 

X={n\neZ,n<0,n>3}. (77) 

While the derivation of (|75|) . ([75)) is straightforward but lengthy, the fact that (|69[) . 
([75]). ([76]) fulfills (Ell) may be verified easily. 

At this point we are looking for further constraints in order to reduce the number 
of unknown constants. The simplest (nontrivial) special case of ([75]) . (f76"|) is 

C n = 0, Vn € T. The corresponding solution for Lq — [3/ p is given by Lq = A. 
However, a solution given by a nonzero constant A may be eliminated by adding a 
corresponding constant to the potential in L(x,t). Thus, this solution need not be 
taken into account and we may set A = 0. 

The 'next simplest' solution, given by A — 0, C n = for all n € X except n = 0, 
takes the form 
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Let us also write down here, for later use, the solution given by A = 0, C n = for 
all n 6 X except n = — 1 . It takes the form 

( P' u 2(^) 3 
3 p- 



Lo=L^=C^ \y (79) 



Comparison of the r.h.s. of Eq. (|78[) with Eq. ([2"5]) shows that the solution ([75]) 
leads to Schrodinger's equation ([32]) . At this point the question arises why this par- 
ticular solution has been realized by nature - and not any other from the huge set of 
possible solutions. Eq. (|78p consists of two parts. Let us consider the two correspond- 
ing terms in pLo, which represent two contributions to the Lagrangian in Eq. (|64|) . 
The second of these terms agrees with the integrand of the Fisher functional (|56|) . 
The first is proportional to p" . This first term may be omitted in the Lagrangian 
(under the integral sign) because it represents a boundary (or surface) term and 
gives no contribution to the Euler-Lagrange equations [it must not be omitted in 
the final differential equation (|6Tj) . where exactly the same term reappears as a 
consequence of the differentiation of p 1 ]. Thus, integrating the contribution pLg of 
the solution ([78]) to the Lagrangian yields exactly the Fisher functional. No other 
solution with this property exists. Therefore, the reason why nature has chosen this 
particular solution is basically the same as in classical statistics, namely the prin- 
ciple of maximal disorder - but realized in a different (local) context and expressed 
in terms of a principle of minimal Fisher information. 

We see that the conflict mentioned at the beginning of this section does not 
exist for the quantum mechanical solution (|78[) . The reason is again that the term 
in Lq containing the second derivative p" is of the form of a total derivative and 
can, consequently, be neglected as far as its occurrence in the term pLo of the 
Lagrangian is concerned. Generalizing this fact, we may formulate the following 
criterion for the absence of any conflict: The terms in Lq containing p" must not 
yield contributions to the variation, i. e. they must in the present context take the 
form of total derivatives (for more general variational problems such terms are called 
'"null Lagrangians" ' [12]). 

So far, in order to reduce the number of our integration constants, we used the 
criterion that the corresponding term in the Lagrangian should agree with the form 
of Fishers functional. This 'direct' implementation of the principle of maximal disor- 
der led to quantum mechanics. The absence of the above mentioned conflict means 
that the theory may be formulated using a Lagrangian containing no derivatives 
higher than first order. As is well known, this is a criterion universally realized in 
nature; a list of fundamental physical laws obeying this criterion may e.g. be found 
in a paper by Frieden and Soffer Thus, it is convincing although of a 'formal' 
character. Let us apply this 'formal' criterion as an alternative physical argument 
to reduce the number of unknown coefficients the above solution. This criterion 
implies, that the derivatives of Lq with respect to p" do not play any role, i.e. the 
solutions of ([68]) must also obey 

da; dp' dp ^ p ^ ^ 

This implies that only those solutions of (p5|) are acceptable, which obey 

0. (81) 



d 2 dp 
dx 2 dp" 



Using ([1)9"]) and ([75]) it is easy to see that the solution ({75]) belonging to n = 
is the only solution compatible with the requirement ([81]) [as one would suspect 
it is also possible to derive ([78]) directly from l|80])]. Thus the 'formal' principle, 
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that the Lagrangian contains no terms of order higher than one, leads to the same 
result as the 'direct' application of the principle of maximal disorder. The deep 
connection between statistical criteria and the form of the kinetic energy terms in 
the fundamental laws of physics has been mentioned before in the literature [lOj . The 
present derivation sheds new light, from a different perspective, on this connection. 

Summarizing, the shape of our unknown function Lq has been found. The result 
for Lq leads to Schrodinger's equation, as pointed out already in section [5j This 
means that quantum mechanics may be selected from an infinite set of possible 
theories by means of a logical principle of simplicity, the statistical principle of 
maximal disorder. Considered from this point of view quantum mechanics is 'more 
reasonable' than its classical limit (which is a statistical theory like quantum me- 
chanics). It also means (see section [5]) that the choice h = is justified as far as the 
calculation of expectation values of p n , n < 2 is concerned. 

In closing this section we note that the particular form of the function L(x, t) has 
never been used. Thus, while the calculation of Lq reported in this section completes 
our derivation of quantum mechanics, the result obtained is by no means specific for 
quantum mechanics. Consider the steps leading from the differential equation (fSTj) 
and the variational principle (|66|) to the general solution (|69|) . ([75]) . (|76p . If we 
now supplement our previous assumptions with the composition law (|47[) . we are 
able to single out the Fisher I among all solutions [compare e.g. (|79|) and ([78]) ]. 
Thus, the above calculations may also be considered as a new derivation of the 
Fisher functional, based on assumptions different from those used previously in the 
literature. 



11 Discussion 



Both the formal transition from classical physics to quantum mechanics (quantiza- 
tion procedure) and the interpretation of the resulting mathematical formalism is 
presently dominated by the particle picture. 

To begin with the interpretation, Schrodinger's equation is used to describe, 
e.g., the behavior of individual electrons. At the same time the statistical nature 
of quantum mechanics is obvious and cannot be denied. To avoid this fundamental 
conflict, various complicated intellectual constructions, which I do not want to dis- 
cuss here, have been - and are being- designed. But the experimental data from the 
micro-world (as interpreted in the particle picture) remain mysterious, no matter 
which one of these constructions is used. 

Let us now consider the quantization process. The canonical quantization proce- 
dure consists of a set of formal rules, which include, in particular, the replacement 
of classical momentum and energy observables p, E by new quantities, according to 

h d ^ h d 
i dx ' i dt ' ' 

which then act on states of a Hilbert state, etc. By means of this well-known set 
of rules one obtains immediately Schrodinger's equation (|32p from the classical 
Hamiltonian of a single particle. While we are accustomed to 'well-established' rules 
like (|82|). it is completely unclear why they work. It does not help if more sophis- 
ticated versions of the canonical quantization procedure are used. If for example, 
the structural similarity between quantum mechanical commutators and classical 
Poisson brackets [35] is used as a starting point, this does not at all change the 
mysterious nature of the jump into Hilbert space given by (|82[) : this structural sim- 
ilarity is just a consequence of the fact that both theories share the same space-time 
(symmetries). 
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Thus, there seems to be no possibility to understand either the quantization 
procedure or the interpretation of the formalism, if a single particle picture is used as 
a starting point. According to a (prevailing) positivistic attitude this is no problem, 
since the above rules 'work' [they illustrate perfectly von Neumann's saying "'In 
mathematics you don't understand things. You just get used to them"']. On the 
other hand, an enormous amount of current activity is apparently aimed at an 
understanding of quantum mechanics. 

We believe that the particle picture is inadequate, both as a starting point for 
the quantization process and with regard to the interpretation of the formalism. 
In fact, both of these aspects seem to be intimately related to each other; a com- 
prehensible quantization procedure will lead to an adequate interpretation and a 
reasonable quantization procedure will only be found if the theory is interpreted in 
an adequate way. The position adopted here is that the most adequate way is the 
simplest possible way. We believe that quantum mechanics is a statistical theory 
whose dynamical predictions make only sense for statistical ensembles and cannot 
be used to describe the behavior of individual events. This ensemble interpretation 
of quantum mechanics (for more details the reader is referred to review articles by 
Ballentine [T] and by Home and Whitaker [16]) is generally accepted as the simplest 
possible interpretation, free from any contradictions and free from any additional 
assumptions expanding the range of validity of the original formalism. Nevertheless 
it is a minority view; the reason may be that it forces us to accept that essential 
parts of reality are out of our control. 

If we accept the ensemble interpretation of quantum mechanics, then the proper 
starting point for quantization must be a statistical theory. The assumption of a 
Hilbert space for the considered system should be avoided. This would mean pos- 
tulating many essential quantum mechanical properties without any possibility to 
analyze their origin. Our aim is the derivation of Schrodinger's equation, from which 
then, afterwards, the Hilbert space structure can be obtained by means of mathe- 
matical analysis, abstraction and generalization. Preferably, Schrodinger's equation 
should be derived from assumptions which can be understood in the framework of 
general classical (statistical as well as deterministic) and logical concepts. This route 
to quantum mechanics is of course not new. Any listing of works 34 ,28,33,32. 9 ,31, 
[2Tl[T4l[TSlfTUll3TIll4Tl[2^] following related ideas must necessarily be incomplete. In the 
present paper an attempt has been undertaken to find a set of assumptions which 
is on the one hand complete and on the other hand as simple and fundamental as 
possible. Throughout this work all calculations have been performed for simplicity 
for a single spatial dimension; however corresponding results will hold without any 
doubt in three-dimensional space (Galilean space-time) as well. 

Our first, and - in a sense - central assumption was the set of relations d2J) , ((3j> , 
which may be characterized as a statistical version of the two fundamental equations 
of classical mechanics displayed in {!]), namely the definition of particle momentum 
and Newton's equation. In writing down these relations the existence of two random 
variables x and p, with possible values from 1Z, has been postulated. This means that 
appropriate experimental devices for measuring position and momentum may be set 
up. The probabilities p(x, t) and w{p, t) are observable quantities, to be determined 
by means of a large number of individual measurements of x and p. Given such data 
for p(x, t) and w(p, t) the validity of the statistical conditions ((2]), Q may be tested. 
Thus, these relations have a clear operational meaning. On the other hand, they do 
not provide a statistical law of nature, i.e. dynamical equations for the probabilities 
p(x,t) and w(p,t). Further constraints are required to define such laws. As shown 
in section[5]the statistical framework provided by ©,([3]) is very general; it contains 
quantum mechanics and its classical limit as well as an infinite number of other 
theories. Thus, it provides a '"bird's eye view"' on quantum theory. 
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Our second postulate was the validity of a continuity equation of the form J7J . 
For the type of theory considered here the validity of a local conservation law of 
probability is a very weak assumption - more or less a logical necessity. The special 
form of the probability current postulated in ([7]) is suggested by Hamilton- Jacobi 
theory (this is of course only an issue for spatial dimensions higher than one). It 
means that an ensemble of particles is considered for which a wave front may be 
defined. A detailed study of such sets of particle trajectories, which are referred to 
as '"coherent systems'", may be found in a review article by Synge [40]. In fluid 
dynamics [7] corresponding fields are called '"potential flow fields'" 

The first two postulates led to an infinite number of statistical theories (coupled 
differential equations for p and S) characterized by an unknown term Lq. Only the 
classical (limit) theory, defined by Lq = 0, allows for an identification of objects 
independent from the statistics; in this case the differential equation for S does not 
depend on p. In all other (non-classical) theories there is a p-dependent coupling 
term preventing such an identification. 

Our third postulate was the assumption that the remaining unknown function 
of p and p', in the coupled differential equations for p and S, takes a form which is 
in agreement with the principle of maximal disorder (or minimal knowledge). This 
logical principle of simplicity is well known, in the form of a postulate of maximal 
entropy, from statistical thermodynamics. In the present case it has to be imple- 
mented in a different and more complicated way, as a postulate of minimal Fisher 
information. This is due to the fact that the present equation for the determination 
of p is a differential equation, i.e. it may not only depend on p but also on deriva- 
tives of p. The entropy is a functional which depends only on p and is unable to 
adjust properly with regard to this new 'degree of freedom'. Our analysis started in 
section [7] with a discussion of the conventional principle of maximal entropy and led 
to the variational principle (I66|) in section [9l which formally describes the principle 
of maximal disorder in the present context. Finally, in section \10\ Eq. (|66| has been 
used to determine the unknown term, which leads to Schrodinger's equation. 

Schrodinger's equation says nothing about the calculation of expectation values 
of p— dependent quantities. To reproduce this part of the quantum mechanical for- 
malism, we had to implement a further requirement, namely conservation of energy 
in the mean. From this fourth assumption the standard quantum mechanical result 
could be recovered for terms of the form p n , where n = 0,1, 2. These terms are 
not the only ones occurring in realistic situations. If we want to study the behavior 
of charged particles in a magnetic field we should be able to calculate expectation 
values of terms of the form ps(x), where s(x) is an arbitrary function of x. Such 
terms (and more generally the inclusion of gauge fields) will be dealt with in future 
work. 

Summarizing, the most important relations of quantum mechanics have been 
derived from assumptions which may be characterized either as purely statistical or 
as statistical versions (or continuum versions) of relations of particle physics. The 
continuity equation and the principle of maximal disorder belong to the former class. 
The statistical conditions, conservation of energy in the mean, and the special form 
of the probability current belong to the latter class. These statistical assumptions 
imply quantum mechanics and are much simpler to understand than the jump into 
Hilbert space given by Eq. I|82p. Of course, all of these assumptions are relations or 
structural properties belonging to the quantum-mechanical formalism; it would not 
be possible to derive quantum mechanics from assumptions which are not quantum- 
mechanical in nature. However, it is not trivial that these, relatively simple and 
comprehensible assumptions are sufficient to derive the basic relations of the whole 
formalism. 

As far as the interpretation of the formalism is concerned, we believe that the 
above derivation of the most basic equations of quantum mechanics from statistical 
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assumptions presents a strong argument in favor of a statistical (ensemble) interpre- 
tation. This becomes even more evident, if the relation between quantum mechanics 
and its classical limit is considered in detail. As discussed in section 0] the transition 
from the classical to the quantum mechanical theory is characterized by the elim- 
ination of a deterministic element, namely the (deterministic) functional relation 
between position and momentum variables. Thus, quantum mechanics contains less 
deterministic elements (it is 'more statistical' in nature) than its classical limit, a 
result in accordance with the general classification scheme set up in section [2] This 
loss of determinism is implicitly contained in the above assumptions and presents 
the essential " 'non-classical" ' element of the present derivation. 

It is interesting to compare the present derivation with other derivations of 
Schrodinger's equation making use of different "'non-classical elements'" [13" j l27 | . 
The loss of determinism mentioned above is also responsible for the crucial role of 
the concept of Fisher information in the present work. This concept was realized 
here in a way different from the one followed previously by Frieden and others |10[ 
[TT|[T3]. On the other hand, several aspects of the present work may also be seen as 
a complement to this previous approach. In particular, the 'classical limit theory' 
which is used as a starting point by these authors may be derived from the first two 
assumptions of the present work. 

12 Concluding remarks 

The present derivation was based on the assumption that dynamical predictions are 
only possible for statistical averages and not for single events - leaving completely 
open the question why predictions on single events are impossible. This deep ques- 
tion remains unanswered; some speculative remarks on a possible source of the 
indeterminacy have been given elsewhere [24| . 

Throughout this work it has been assumed that p is not a (in single events) 
observable quantity but plays the role of a probability density. There are of course 
other possibilities, besides this 'probabilistic' (or 'immaterial') interpretation of p = 
l^l 2 . According to the original point of view of Schrodinger and de Broglie, |^>| 2 was 
an observable field measuring something like the density distribution of an 'extended 
particle'. More recently, this interpretation has been reconsidered in an interesting 
paper by Barut [3]. But the results of modern high-precision measurements |42j 
strongly support the probabilistic interpretation and exclude, in our opinion, the 
original 'material' view of Schrodinger and de Broglie. 

A third interpretation for p is possible. It could play the role of a density of a 
stream of particles in the framework of an approximate continuum theory. This p 
would also be an observable quantity. An interesting problem for future research 
is the question if the present derivation can be adapted in such a way that this 
interpretation of p makes sense. The resulting Schrodinger equation ([32]) would be 
a classical field equation despite the fact that it contains an adjustable parameter 
h. However, it would be only approximately true (what is actually observed are 
particles and not fields) and could, therefore, play a role as a starting point for a 
procedure known as second quantization. 

'Interaction between individual objects' and the corresponding notion of force 
are macroscopic concepts. In the microscopic domain, where according to the present 
point of view only statistical laws are valid, the concept of force looses its meaning. 
In fact, in the quantum- mechanical formalism 'interaction' is not described in terms 
of forces but in terms of potentials (as is well known, this leads to a number of subtle 
questions concerning the role of the vector potential in quantum mechanics). The 
relation between these two concepts is still not completely understood; it seems that 
the present statistical approach offers a new point of view to study this problem. 



28 



In a previous work |24j of the present author, Schrodinger's equation has been 
derived from a different set of assumptions including the postulate that the dynamic 
equation of state may be formulated by means of a complex-valued state variable 
tp. The physical meaning of this assumption is unclear even if it sounds plausible 
from a mathematical point of view. The present paper may be seen as a contin- 
uation and completion of this previous work, insofar as this purely mathematical 
assumption has been replaced now by other requirements which may be interpreted 
more easily in physical terms. Finally, we mention that there are, besides the points 
mentioned already, several other open problems for future research, extending the 
range of validity of the present approach. These include the consideration of gauge 
fields, a generalization of the present formalism to many particles, and a relativistic 
formulation. 
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